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Abstract. In a stationary ergodic process, clustering is defined as the tendency of 
events to appear in series of increased frequency separated by longer breaks. Such 
behavior, contradicting the theoretical "unbiased behavior" with exponential dis- 
tribution of the gaps between appearances, is commonly observed in experimental 
processes and often difficult to explain. In the last section we relate one such empiri- 
cal example of clustering, in the area of marine technology. In the theoretical part of 
the paper we prove, using ergodic theory and the notion of category, that clustering 
(even very strong) is in fact typical for "rare events" defined as long cylinder sets in 
processes generated by a finite partition of an arbitrary (infinite aperiodic) ergodic 
measure preserving transformation. 



Introduction: recurrence and clustering 

Let us consider an aperiodic ergodic dynamical system {X, B, fi, T), where (X, B, /i) 
is a standard probability space, and T : X ^ X is measurable, almost surely 1 — 1, 
preserves /i, for which it is an ergodic transformation [W]. 

Let us next consider a measurable partition V = {Pi, ... ,Pn} of the space X. 
To this partition we associate the natural symbolic factor of the system, using the 
coding map 

c : X e X ^ i e {1, . . . ,n} ^ X e Pi 
which generates the factor map [W] 

tt: X ^ {!,... ,n}^ 
X ^ {c{T"x))nez- 

Then ii i' — tth and a : {1, . . . , n}^ {1, • • ■ , n}^ denotes the left shift map, we 
obtain the factor tt : {X, B, fi,T) ^ {{1, . . . , n}^, C, v, a) which satisfies ttoT = aoir 
(the cr-algebra C is the obvious one) . 

This standard procedure, based on the selection of a partition on X , transforms 
the initial abstract dynamical system into a symbolic dynamical system. The coding 
procedure is somewhat natural in that it produces the accessible dynamics through 
the selection of a finite partition of the space of observables X. 
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On the resulting symbolic space we inherit of standard subsets of the phase 
space {1, . . . ,n}^, which we call cylinders, or blocks. A typical such, of length p, 
is obtained by selecting a pattern w G {1, . . . , n}^, and defining 



When the length p of the cylinder [w] increases, of course, the measure tends 
to 0, which results from aperiodicity of the dynamics. 

Whence long cylinder sets are prototypical rare events (i.e. small measure sets) 
in the symbolic dynamics we accessed by cross-ruling our initial space X with 
the partition V. The system is ergodic whence recurrence to any positive measure 
subset of X must occur, a.s., and happens along a typical trajectory with frequency 
equal to the measure of the set (this is the ergodic theorem). 

Our paper concerns the study of recurrence to rare events generated by partitions 
as above. If we set for given B G B with /x(-B) > 0, 



then tb is /z-a.s. well defined, integer valued. When a; G -B, it is called the return 

time of X to S, otherwise it is called the entry time of x to B. Kac's theorem 
[K] states that J2k>i ^I^H'^b = k} Ci B) = 1, whence the random variable ijl{B)tb 
defined on the probability space induced on B has expected value equal to 1. We 
therefore call ^{B)tb the "normalised return time" . 

For rare events such as cylinder sets the distribution of return times or entry 
times to such events, though by the ergodic theorem has on the overall frequential 
distribution, may change quite a lot depending on the dynamics of the system. 
What is oftenly looked after is weak convergence of such distributions as the mea- 
sure of the sets B (resp. lengths of the cyinders) shrink to (resp. go to oo). 
This is because if such convergence holds then the limiting distribution provides 
information about recurrence to rare events in the dynamical system, as it is an 
approximation of distributions converging to it weakly. 

Such a limit distribution is called an asymptotic for return times. We denote by 
Fb the distribution function of the normalised return time to B, and by Fb the 
distribution function of the entry time to B, i.e. of the random variable ^{B)tb- 
Following the same lines, asymptotics for entry times arc analogously defined. 

Possible asymptotics have been characterized only recently ([L] for return times, 
[K-L] for entry times). An integral formula connecting asymptotics for entry times 
and the one for return times has been provided in [H-L-V], where it is proved that 
weak convergence of distributions, whenever it holds, must hold simultaneously for 
entry and return times. If F is the weak limit distribution function for return times 
and F is the one for entry times, then additionally 



Many research papers have been devoted to the study of return times of specific 
events, so-called cylinder sets, in stationary ergodic processes. We refer also the 
reader to expository papers [C] and [A-G] for further information. Most asymptotics 
along cylinder sets were found in mixing enough dynamical systems [W], but in all 
cases were proved to be exponential with parameter one, the distribution of which 



[w] ={y€{l,...,n} : {yo, ■ ■ ■ ,t/p-i) = (wo, • • • ,Wp-i)}- 



Tsix) = min{A; > 1 : T'^x € B}, 
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is the only fixed point of (★). Furthermore, in the treated cases, entropy of the 
system was positive [W]. 

An essential progress in understanding some phenomena towards asymptotics 
in processes with positive entropy has been obtained recently in [D-L], where it 
is proved that whatever the system, as soon as its entropy is positive, then any 
asymptotic F for entry times along cylinder sets must satisfy 

F{t) < 1 - e"*, t > 0. 

This was interpreted as a first explanation to a complicated and misunderstood 
common-sense phenomenon known as "the law of series" : indeed in any ergodic 
system, if t > and > 0, if we define the variable 

I{x) = #{0 < n < : e B}, 

then invariance of the measure implies that 

E(/)«t, (★★★) 

with uniform accuracy [D]. Considering an independent symbolic process so as to 
be one for which most randomness occurs, and using the well-known fact that in 
such system the asymptotics along cylinder sets exist and are exponential, we are 
led to argue that a small cylinder set B has a tendency to occur presenting clusters 
more frequently than it would in an independent process, if given that B occurs, the 
conditional expectation of I increases, compared to what it is in the independent 
case. 

That is to say, B clusters (or appears "in series") in distribution, if 

E(J|J > 0) > Eind{Ilnd\Ilnd > 0), 

where the subscript Ind refers to the independent process. Now using and the 

fact that E(/|/ > 0) = E(/) / > 0), observing moreover that > 0) = Fgit), 
and finally approximating FB,ind{t) « 1 — e~*, we deduce 

B clusters ^ FB(t) < 1 - e"*. 

In [D-L] we introduced the notions of attracting, strong attracting, repelling, 
and neutral recurrences for a set B, in reference to the above explained comparison 
to the case of the independent process. Below is a purely naive illustration of the 
phenomena we are describing, interpreted as "along a typical orbit" . 



unbiased 
attracting 

strong attr. 



This is the interpretation along which it was stated in [D-L] that can be 
understood differently, saying that in positive entropy systems, clustering for rare 
events must be at least what it reveals to be in the neutral independent case. In 
other words, "laws of series" (more frequent clustering for rare events) correspond 
to the natural behaviour for positive entropy systems. 
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This was completed in [D-L] by the study of asymptotics along cylinder sots for 
varying partitions V of the space X, so as to understand what generically (in a 
Baire category setting) could happen to be the case (the set of partitions can be 
turned to a Polish structured space). 

It was proved that generically (on V) there exists an upper density one set 
A/" C N of lengths of cylinder sets along which asymptotics exist, and for entry times, 
converge to the degenerated distribution function F = 0, the one corresponding to 
strong attracting above (enormous clusters). 

In the present paper we go forward in this last direction and prove that this 
genericity holds without the positive entropy assumption. We also addressed the 
question to specialists, since the "law of scries" , frequently referred to imdcr more 
pessimistic interpretations like "Murphy laws" , is used by engineers and manufac- 
turers and formalised under the label "Murphy proof devices", in aeronautics for 
instance. In fact other occurrences of Murphy proof procedures appear, and we 
reveal such in the last part of this paper, were we relate to some rare but clustering 
disturbances that occur on low speed flights of underwater gliders, without further 
understanding at this point of knowledge. 

The only reasonable understanding for this generic clustering phenomenon we 
can talk out, at this point, more like a guess, is the observation that rare events, 
to occur, need favourable conditions, probably even rarer, and that therefore when 
they are collected, conditionally, the event has a better measure and comes through 
easier. 

The paper is organised as follows : the next section introduces rigourous for- 
mulations about genericity. The next section presents the formulation and proof 
of our main result (Theorem 1). In the formidatioii we use the notion of strong 
clustering explained above in the Introduction. The last section is a presentation of 
an empirical example of a strongly clustered process occurring in marine technol- 
ogy, preceded by a short description of the performed experiment and its technical 
background. 

Typicality of strong attraction without entropy assumptions 

Let us go back to the original dynamical system {X, B, ji, T) on which the ob- 
served symbolic dynamics is defined by the partition V. 

Denote by the collection of all measurable partitions V oi X into / cells. In 
general there is no canonical measure on the space The meaning of "almost 
surely" with respect to a random partition does not have a definite meaning. In- 
stead, we will adopt the topological approach, and "typicality" defined in terms of 
so-called category. 

Recall that in a complete metric space a subset is called residual if it contains 
a dense Gs set, equivalently, if its complement if of first category (i.e., is a count- 
able union of nowhere dense sets). The Baire category theorem asserts that the 
intersection of any countable collection of residual sets is still residual. Because 
similar property is enjoyed by sets of measure 1 in probability spaces, residual sets 
are considered a topological analog of sets of measure 1. Given a fixed residual set, 
its elements are referred to as "typical" . 

If {X, B, n) is a probability space, the Rokhlin metric endows ^ii with a structure 
of a complete metric space. The distance in this metric between two Z-element 
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partitions Q is defined as 

rf(p,Q) = inf ^/x(AA7r(A)), 

where tt ranges over all bijections from V to Q and A denotes the symmetric 

difference of sets. A partition Q which is very close to V in this distance may 
be considered a slight perturbation of V. Our theorem below applies to processes 
typical in the following sense : given an ergodic dynamical system {X, B, /it, T) and 
some I G N, the set of all /-clement partitions V oi X such that the generated 
process ({I,-- - , ^}^, C, ^'p, ct) satisfies the assertion of the theorem is residual in 
the Rokhlin metric in . 

We will also use the phrase that a property <I> holds for all blocks of "majority" 
of lengths. By this we mean that there exists a set N$ of upper density 1 such that 
if n G N$ then all blocks of of length n satisfy i>. Upper density is defined as 

, #(N$n[l,n]) 
£>(N$) = limsup -^^-^^ ^ ■ 

Formulation of the Theorem and the proof 

Below we give two formulations of the main result. The first one is short thanks 
to the terminology introduced above and appeals to the intuitive understanding 
of the subject. The latter formulation is the rigorous version without using the 
shortcut terminology. 

Theorem 1. In a typical ergodic process, for majority of lengths n, all rare ele- 
mentary events of length n reveal strong clustering. 

Theorem 1. Let {X,B, jJL,T) he an ergodic not periodic dynamical system. Fix 
some natural I > 2. Then in the space of all l-element measurable partitions of 
X endowed with the Rokhlin metric there exists a residual subset € such that for 
every V € € the generated process ({1, . . . , £}^, C, jy-p, c) has the following property 
There exists a set No CN of upper density one, such that for every e > there 
is He € N such that for every n G No, n > Ue and every block B of length n, the 
Fb{s) < e^. 

Proof. Fix e > and N & N. Suppose a partition V G ^ii satisfies the following 

property $<:,jv : 

"For every n € [A'', N"^] and every block B of length n holds FB{e) < e^." 

Clearly, if we perturb V very little, the property will still be satisfied. Thus ^e,N 
holds on an open set Cg^jv of partitions. Of course, the set 

N>1 

of partitions such that the same property holds for some N, is also open. The main 
effort in the proof will be to show that this set is also dense. Once this is done, the 
proof is complete, because then the dense Gs set £ of partitions which fulfill the 
hypothesis of the theorem is obtained by intersecting the sets £e over countably 
many parameters e converging to zero. Every element of this intersection satisfies 
^e,N for arbitrarily small e and some N depending on e. Clearly, N must grow 
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to infinity as e decreases to zero. Notice that for any infinite sequence of natural 
numbers N the set [J[N, N'^] has upper density 1 in N. 

It remains to prove the density of the open set C^. The proof (as most proofs 
of typicahty) may look a bit artificial, as it is done by perturbing an arbitrarily 
chosen partition P in a very specific way, so that a highly particular partition is 
created. This perturbation is then shown to belong to the set £e. It is important 
to realize that once the density is proved, the "largeness" properties of open dense 
sets imply that £e is represented in the vicinity of V by many more partitions, not 
only the artificially constructed perturbation. 

We begin with a technical lemma. 

Lemma 1. In every ergodic not periodic dynamical system, {X,B, iJ,,T), for each 
sufficiently large r € N there exists a "semiperiodic r-marker", i.e., a measurable 
set Fr such that the return time to assumes almost surely only two values: r 
and r + 1. 

Proof. Wc need a measurable and shift-invariant procedure dividing the trajectory 
of almost every point x ^ X into intervals of the two lengths r and r + 1. (Shift- 
invariant means that the division of the trajectory of Tx coincides with the division 
of trajectory of .t with shifted enumeration.) Once this is done, the set Fr is defined 
as the collection of all points which have a division marker at the coordinate zero. 

Using ergodicity and nonperiodicity (or the Rokhlin theorem, see [R]) it is very 
easy to construct a set of positive measure such that the return time to this set 
assumes almost surely only values larger than or equal to r^. The times of visits 
to this set divide each trajectory into intervals of lengths at least (see also [L]). 
Every such interval can be further divided into intervals of two lengths r and r + 1, 
and we can fix one such way for every length m > r^ (for example, we can choose 
the division which minimizes the number of longer intervals and all longer intervals 
appear to the right of the shorter ones). This (clearly measurable) technique divides 
almost every trajectory into desired pieces in a shift-invariant way. as required. □ 

We continue with the main proof; wc are proving that Q^e is dense. Fix any 
^-element partition V and some (5 > 0. We will construct a perturbation V' within 
the distance 5 from V , which belongs to C^jv for some N . Since the partition has 
at least two elements, we select two of them and label them and 1. Pick L so large 
such that there exist K = ^ different blocks W'^ {k = 1,2, . . . , K) of length 
none of them equal to 000 ... 0, each of measure at most (here we admit also 
blocks that do not appear in the system and hence have measure zero) . Denote by 
Wk the block IWj^lOOO ... (with ^ zeros) of length L. The blocks Wi,...,Wk 
also have measures at most jzk- Notice that any two (different or equal) blocks 
from this family never occur with an overlap. Let r be an integer larger than 
Let N be larger than 2r + 2. Every cylinder set of positive measure over a block 
_B of a length n > N decomposes into a finite number of sets depending on the 
positioning of the r-markers (there is at least one such marker in every occurrence 
of B). We denote these sets by Si (i = 1, 2, . . . ,j{B)). Let M be so large that all 
points X, except in a set Z whoso measure is smallc;r than |, satisfy the following: 
For every block B of positive measure and length between N and N"^, and for every 
i = 1, 2, . . . ,j{B) the orbit of x visits Bi between times and M at least | times. 
Now let ri ^ KM. 

At this point we modify the partition V as follows: In the 'P-name of (almost) 
every x the interval between two consecutive ri -markers splits into K intervals of 
length M (the last one may be of length M + 1). We call them "sectors". In the 
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sector number k {k = 1,2, . . . , K) we put the block Wk immediately to the right of 
every r-marker (replacing whatever was there), and next we replace all occurrences 
of all blocks Wj {j = 1,2, . . . , K) also by Wk- Because the blocks Wj do not 
overlap, such exchange happens on disjoint intervals. Notice that we have changed 
the partition V only on the r-marker set Fj. and its L consecutive preimages by 
T (jointly a set of measure |), and on the cylinders corresponding to the blocks 
Wi, . . . , Wk, and also their L preimages by T (jointly another set of measure |). 
Thus the Rokhlin distance between V and the modified partition 7-" is at most S. 

Let B' be an arbitrary block of some length n between N and N'^ appearing with 
positive probability in the process generated by the modified partition. Consider 
an .T G X. By crgodicity, B' occurs almost surely in the P'-namc of x at some 
position n. The block B' is long enough so that at least one r-marker occurs within 
its length. Next to this marker B' contains one of the blocks Wk ■ Because Wk does 
not appear in any sector other then the sector number k (of an interval between 
two ri-markers) B' also can occur only in sectors that carry the number k. In the 
■P-name of x (i.e., before modification) at the position n there is some block B and 
also there is a fixed positioning of r-markers along this block, i.e., T"(x) belongs 
to some Bi. We know that the orbit of x visits Bi at least | times in each sector, 
except the cases when at the beginning of the sector the orbit falls into Z (which 
happen with probability < |). If this sector happens to be the sector number k, 
in the modified partition every such visit generates an occurrence of B' . Thus we 
conclude the following: The block B' occurs in the 7-"-name of every x G Z at least 
I times in more than | of all sectors number k and zero times in sectors carrying 
other numbers (see Figure 2; stars indicate the ri-markers). 



s.l s.2 



s. k 



s.K 



. 1 s.2 



. |b'b'b' 
s. k 



Figure 2: The distribution oj the occurrences of B' . 



It remains to compute the intensity, normalize the waiting time, and prove that 
Fb' (e) < • The intensity of the occurrences of B' is at least | j^jg = j^jj ■ ^^^^ 
multiplier | takes into account the visits in Z). The waiting time for the signal is 
smaller than M only when the zero coordinate falls in a sector number k or in the 
preceding sector. This happens with probability at most -j^ — ■ Otherwise the 
normalized waiting time is larger than M-tItj = e. This ends the proof. □ 



Clustering in an empirical process 

One of the multiple illustrations of the law of series appears in the precise study 
of the behavior of an "underwater glider" , one of the newest inventions in marine 
technology. The underwater glider is an autonomous robot able to cross thousands 
of miles without any propeller [B-D-H-L]. By changing the buoyancy of the ballasts, 
the vehicle tends to "fly" alternatively downwards or upwards to the surface. As 
the vehicle is equipped with wings - hence has some gliding ability, the variations 
of buoyancy are changed into horizontal motion. Very little energy is needed to 
modify the buoyancy and, as a consequence, the linear flight between the top and 
bottom points of the trajectory of the glider is costless in terms of energy (see 
Figure 3). 




This is the positive aspect of this brand new technology. However, during our 
numerous and lengthy tests at the sea we discovered some strange side effects, 
mainly linked to the fact that the glider flies at a very low speed and therefore is 
very sensitive to hydrodynamic phenomena. 

It was discovered [B-D-H-L] that even though one makes no doubt that the flow 
around the glider is linear, and the physical system (glider plus flow) is not chaotic 
(zero entropy), some heavy turbulences appear during short periods of time. They 
are encountered under a conjunction of circumstances with on ocean scale are pretty 
rare. The point is that it has been observed that these rare circumstances produce 
heavy turbulences, and those turbulences have a tendency to repeat overexpectedly. 
After a while, the stable flow returns. The authors decided to call theses turbulences 
"germs" , due to some similarities with the fact that the cavitations originating the 
perturbations could be activated by some "germs" in the water, and "pulsed out" 
somewhat once at a time. 

The authors were unable to predict, and still aren't, when and why these turbu- 
lences appear in a given homogeneous layer of water in the ocean. What they know 
by experience, is that once an unexpected turbulence appears, then others follow 
repeatedly with short gaps in time until the series stops for a new long period of 
smooth linear flow. This corresponds exactly to the pattern described as "strong 
clustering" . 

This series type behavior of appearances of turbulences has been integrated so 
as to produce, when the first turbulence of a series appears, some warning, and 
consequently for autonomous vehicles, the control law of the underwater vehicle 
has been reinforced in these flight specific phases. 

The next step would be to produce some statistical inference, and derive from 
such risk estimation for stability of trajectory under reinforced control laws of 
different levels of security. This has of course immediate industrial consequences. 
But it is a difficult mathematical challenge. 
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